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I  Introduction 

The  purpose  of  this  survey  was  to  begin  to  develop  a  worthwhile  program 
for  estimating  city  and  town  populations  in  Montana.  Since  no  precedent 
existed  for  establishing  such  a  program  it  was  decided  to  find  out  what 
other  state  governments  were  making  city  and  town  estimates,  and  how. 
A  letter  was  sent  to  each  state  governor's  office  requesting  information 
on  any  city  estimation  procedures  currently  being  used  within  their 
state  government.  The  response  to  these  inquiries  was  a  gratifying  86 
percent.  While  many  states  did  not  have  an  existing  program,  some  had 
advanced  and  detailed  programs.  The  final  results  of  the  survey  are 
published  in  Part  III  of  this  report.  Since  it  would  be  difficult  to 
talk  about  procedures  used  to  estimate  local  area  population  without 
some  type  of  background  in  demography,  an  attempt  is  made  in  Part  II  of 
this  report  to  give  the  reader  a  sort  of  primer  on  estimating  techniques. 
Most  of  the  material  in  Part  II  has  been  abstracted  from  three  principal 
sources.  These  are:  The  Methods  and  Materials  of  Demography,  Vol.  II, 
1971,  U.  S.  Bureau  of  Census;  Procedures  for  Processing  Population  Estimates 
and  Projections  for  State  Tax  Sharing  in  Wisconsin,  Wisconsin  State 
Bureau  of  Planning  and  Budget,  1973;  and  Population  Trends  1974,  Office 
of  Program  Planning  and  Fiscal  Management,  State  of  Washington. 

The  final  part  of  this  report  is  a  bibliography  of  sources  on  methods 
and  procedures  used  in  small  areas  estimation.  This  listing  is  a 
composite  of  bibliographies  obtained  from  the  survey. 


II  Estimating  Techniques 

The  most  complete  and  reliable  source  of  population  information  is  the 
decennial  census,  based  on  a  house-to-house  enumeration.  However,  since 
population  distribution  is  constantly  changing  it  is  not  always  adequate 
to  use  five-  or  ten-year-old  statistics  based  on  a  previous  census.  The 
Bureau  of  Census  itself  is  in  the  process  of  developing  local  area 
population  estimates  for  revenue  sharing  since  1970  census  counts  do  not 
reflect  current  population  adequately.  It  is  important  to  note  then 
that  population  estimates  are  not  complete  counts  but  only  substitutes 
for  them.  It  is  only  by  accident  that  an  estimate  will  ever  be  exactly 
equal  to  a  true  count;  however,  one  would  always  like  to  choose  the 
methods  that  yield  estimates  close  to  the  true  counts. 

The  major  advantage  of  estimates  over  actual  counts  is  lower  costs.  It 
would  be  far  more  expensive  to  do  a  complete  census  of  a  county  or  city 
than  to  collect  good  symptomatic  data  for  estimating  the  population. 
While  some  data  might  be  rather  expensive  to  collect  (the  Census  Bureau 
estimated  that  it  cost  $.10  a  record  for  76  million  records  to  estimate 
local  population  from  IRS  data),  other  sets  of  data  exist  just  for  the 
asking.  Birth  and  death  records  for  cities  and  towns  are  almost  always 
available  through  the  State  Department  of  Health  or  the  National  Center 
for  Health  Statistics.  School  enrollment  data  can  usually  be  obtained 
at  the  State  Educational  Department.  Likewise,  automobile  registration 
data  is  sometimes  available  through  the  State  Highway  Department. 


Enrollment  and  automobile  registration  data  are  good  examoles  of  symptomatic 
data  on  population  change,  and  if  they  are  automated  (available  on 
computer  tapes  or  cards)  they  are  particularly  appealing.  However,  to 
be  useful  it  is  almost  always  necessary  that  the  above  data  be  available 
for  a  "base  year"  --  usually  the  census  year  of  1970.  It  is  even  more 
rewarding  if  the  data  is  available  for  earlier  censal  periods  as  this 
allows  one  to  check  the  estimates  for  accuracy.  Types  of  data  and  their 
application  to  local  area  estimation  are  covered  in  the  last  part  of  the 
methodology  section. 

Estimates  can  be  divided  into  several  types  on  the  basis  of  time  reference 
and  the  basis  of  their  derivation.  Demographers  generally  classify 
estimates  into  one  of  three  categories.  These  are: 

(1)  Intercensal  Estimates  --  an  estimate  of  a  population  between 
two  censuses. 

(2)  Postcensal  Estimates  --  an  estimate  of  a  population  following 
the  latest  census. 

(3)  Projections  --  an  attempt  to  predict  the  population  of  an  area 
at  some  future  time. 

The  term  "population  estimate"  generally  refers  to  the  estimates  defined 
in  (2),  while  the  term  "population  orojection"  always  refers  to  the 


third  type  of  estimate.  Projections  are  made  by  constructing  mathe- 
matical models  to  reflect  the  operation  of  the  vital  events  --  birth  and 
death  --  and  to  reflect  the  less  predictable  phenomenon  of  migration. 
Naturally,  the  number  produced  by  a  projection  method  is  quite  sensitive 
to  the  assumptions  regarding  births,  deaths  and  migration  which  are 
built  into  the  model . 

Postcensal  estimates  can  generally  be  accomplished  by  using  procedures 
and  models  tied  to  actual  current  data.  The  growth  or  decline  of  an 
area's  population  affects  such  things  as  school  enrollment,  automobile 
registration,  and  the  number  of  births  and  deaths  in  an  area.  In  other 
words,  to  make  population  estimates  one  should  use  symptomatic  data  on 
variables  that  are  strongly  correlated  with  population  size.  The  key 
then,  to  estimating  local  area  population,  is  to  develoo  some  way  of 
determining  or  estimating  the  relationships  between  the  symptomatic  data 
and  the  population  size. 

The  next  section  contains  descriptions  of  methodologies  currently  being 
used  by  state  governments  to  estimate  local  area  copulation.  While 
county  populations  are  being  estimated  by  the  Bureau  of  Census,  no 
federal -cooperative  program  exists  at  this  time  for  city  and  town 
estimates.  The  purpose  of  the  next  section  is  to  give  some  elementary 
insight  into  current  methodology  being  used  to  estimate  city  and  town 
populations. 


Methodology 

Methods  for  estimating  local  area  populations  are  usually  classified  as 
one  of  four  types.  These  are: 

(1)  Component  Methods 

(2)  Composite  Methods 

(3)  Censal-Ratio  Methods 

(4)  Ratio-Correlation  Methods 

At  the  city  and  town  level  the  use  of  one  methodology  or  another  is  most 
often  dictated  by  data  availability.  Should  one  be  fortunate  enough  to 
be  able  to  choose  between  types  of  methods  to  use,  a  test  and  evaluation 
system  should  be  set  up  to  check  estimates  against  actual  counts. 

(1)  Component  Methods 

Component  methods  use  current  symptomatic  data  to  measure  net  migration 
which  is  combined  with  birth  and  death  statistics  to  estimate  population 
change.  The  general  equation  can  be  presented  as: 


P2  =  P,  +  B  -  D  +  NM 

Pp  =  current  population 

P.  =  base  population  (1970  census) 

B  =  number  of  births  between  P;^  and  P2 

D  =  number  of  deaths  between  P^  and  Pg 

NM  =  net  migration  between  P^  and  P^ 


where  P^  can  usually  be  obtained  from  the  Bureau  of  Census  while  the 
number  of  births  and  deaths  can  be  obtained  from  the  State  Department  of 
Health  or  the  National  Center  for  Health  Statistics.  Current  data  on 
net  migration  is  hardly  ever  available  and  thus  net  migration  for  the 
area  must  usually  be  estimated  using  symptomatic  data.  The  equation 
then  becomes: 


A.  A 

P2  =  Pi  +  B  -  D  +  NM 


A     A 

where  P^  and  NM  are  estimates  of  current  population  and  net  migration. 

The  most  comnon  known  variant  of  the  component  methods  is  the  Component 
Method  II  developed  by  the  Bureau  of  Census.  In  this  method  school 
enrollment  is  used  to  measure  net  migration.  Other  types  of  data  which 
have  proved  useful  or  may  prove  useful  in  estimating  net  migration  for 
small  areas  include  data  from  the  family  allowance  system,  the  social 
security  system,  tax  records,  automobile  registration,  voter  registration 
or  any  other  data  resulting  from  registration  which  would  symptomize 
changes  in  population. 

(2)  Composite  Methods 

The  composite  method  uses  data  from  a  number  of  sources  to  estimate 
various  age  groups  of  the  local  population.  An  estimate  is  then  derived 
for  the  entire  population  of  the  area  by  summing  the  independent  age 
group  estimates.  For  example,  the  number  of  deaths  of  persons  ^5  years 
old  and  over  by  age  may  be  used  to  estimate  the  population  in  this  age 


range;  the  number  of  births  may  be  used  to  estimate  the  number  of 
females  in  the  child  bearing  ages  (18  to  44  years),  which,  in  turn,  may 
be  used  to  estimate  the  number  of  males  in  this  age  range;  school  enrollment 
data  may  be  used  to  estimate  the  population  of  school  ages  (5  through  17 
years  old);  and  the  number  of  births  in  the  previous  5-year  period,  in 
conjunction  with  school  enrollment  data,  may  be  used  to  estimate  the 
population  under  5  years  of  age.  Tax  records  and  voter  registration 
records  may  also  be  used  as  a  source  of  data.  The  most  widely  used 
variation  of  the  composite  method  is  the  Boque-Duncan  Method.  Essentially, 
this  method  is  used  to  compute  local  area  estimates  in  the  following 
manner: 

a.  Estimate  the  45  and  over  population  by  dividing  age-sex-color 
specific  death  rates  into  corresponding  death  registration  for 
the  estimate  year. 

b.  Estimate  females  15  through  44  years  old  using  birth  statistics. 
Tabulated  by  age  and  color  of  mother. 

c.  Compute  the  male  population  15  through  44  years  old  using  age- 
color  specific  sex  ratios  in  conjunction  with  the  female 
population  estimated  in  step  (b). 


Use  school  enrollment  data  to  estimate  the  5  through  14  year 
olds.  This  is  accomplished  by  taking  the  school  enrollment 
for  the  estimate  year  and  dividing  by  the  rate  of  school 
enrollment  for  this  particular  age  group. 


e.  Estimate  the  population  under  5  years  old  by  means  of  color 
specific  fertility  ratios  (the  ratio  of  children  under  five  to 
females  20  through  34),  adjusted  to  reflect  national  trends. 

f.  Sum  the  age-sex-color  specific  estimates  produced  in  steps  (a)- 
(e)  to  yield  a  composite  estimate. 

The  major  drawback  to  this  as  well  as  other  composite  methods  is  the 
vast  amount  of  data  needed.  In  small  areas  one  is  fortunate  to  have 
even  a  small  part  of  the  data  needed  for  making  composite  estimates. 
This  makes  the  next  types  of  estimates  Mery   appealing  since  not  only  are 
they  easiest  to  perform  but  also  they  use  one  set  of  data  to  estimate 
the  population  directly. 

(3)  Censal-Ratio  Methods 

Censal -ratio  methods  use  current  symptomatic  data  to  estimate  total 
population,  or  total  population  change,  directly  rather  than  merely  the 
component  of  migration.  Basically,  in  utilizing  this  method  one  computes 
the  ratio  of  the  symptomatic  data  to  the  total  population  at  the  census 
date,  extrapolates  this  ratio  to  the  estimate  date  and  divides  the 
value  from  the  symptomatic  series  at  the  estimate  date,  by  the  ratio, 
yielding  an  estimate  of  total  population.  More  often  than  not  the 
extrapolated  ratio  is  assumed  to  remain  constant  from  the  census  data  so 
that  the  procedure  is  further  simplified. 


Many  series  of  data  have  been  considered  useful  as  "symptomatic"  series. 
This  list  includes  school  enrollment  or  school  census  data,  electric, 
gas  or  water  installations  or  customers,  volume  of  bank  receipts,  volume 
of  retail  trade,  number  of  building  permits  issued,  number  of  residential 
postal  "drops,"  voter  registration,  welfare  recipients,  auto  registration, 
birth  and  death  statistics,  and  tax  returns.  Variants  of  the  censal- 
ratio  method  are  usually  classified  by  the  types  of  data  that  are 
employed  in  making  the  estimates.  Some  of  the  variants  are; 

a.   Vital  rates  method.  This  method  uses  birth  and  death  rates  to 
arrive  at  two  independent  population  estimates  which  are  then 
averaged  to  derive  a  single  composite  estimate.  The  general 
equations  for  the  vital  rates  method  at  the  town  or  city  level 
would  be: 

TPB2  =  (CBRl  •  TB2)  t  {CBR2  •  TBRl) 

where 

TPB2  =  population  estimate  of  the  town  or 
city  based  on  birth  rates. 

CBRl  =  County  birth  rate  during  census  year. 

TB2  =  number  of  births  in  the  town  or  city 
during  the  estimate  year. 


CBR2  =  county  birth  rate  during  estimate  year. 

TBRl  =  town  or  city  birth  rate  during  the  census 
year. 

An  estimate  based  on  death  rates,  TPD2,  is  then  compiled  in  a 
similar  manner  and  the  two  estimates  TPB2  and  TPD2  are  then 
averaged  for  a  final  estimate.  As  would  be  expected  in  towns 
and  cities  with  small  population,  this  type  of  estimate  is  not 
at  all  stable.  This  problem  can  be  overcome  somewhat  by 
averaging  the  birth  and  death  data  over  a  three-year  period. 

b.   Simple-ratio  method.  Estimates  for  local  areas  (cities)  are 
computed  by  using  population  estimates  for  larger  areas 
(counties)  and  assuming  constant  city-county  growth  ratios 
from  the  census  years.  This  relationship  can  be  represented 
by  the  following  equation  for  a  city  or  town: 

PT2  =  PTl  •  PC2  T  PCI 

where 

PT2  =  population  estimate  of  the  city  or  town. 

PTl  =  population  of  the  town  or  city  during  the 
census  year. 

PC2  =  population  estimates  of  the  county. 
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PCI  =  population  of  the  county  during  the  census 
year. 

As  revealed  in  the  above  equation,  this  is  the  most  widely  used 
and  simplest  method  of  obtaining  local  area  estimates.  The 
assumption  that  a  town  or  city  is  growing  at  the  same  rate  as 
the  parent  county  is  rarely  valid.  However,  in  some  instances 
because  of  the  lack  of  data,  this  is  the  only  type  of  estimate 
that  is  possible. 

c.   Housing  units  method.  Some  indicators  reflect  changes  in  the 
number  of  housing  units  or  households  rather  than  in  the 
number  of  individuals.  These  indicators  include  the  number  of 
building  permits  issued  within  a  local  area,  electric,  gas,  or 
utility  customers,  residential  postal  delivery,  etc.  The 
housing  unit  method  uses  these  types  of  data  to  estimate  the 
number  of  housing  units  or  households  in  the  estimate  year 
which  in  turn  is  used  to  estimate  total  population.  The  basic 
formula  for  estimating  the  current  population  in  households, 
without  allowance  for  change  in  average  size  of  household,  is: 


P2  =  (H  +  U)  -  Pj 
H~ 


where 


P2  -  population  in  households  on  estimate  date 


H  -  occupied  housing  units  on  the  last  census  date 
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U  =  net  increase  in  occupied  housing  units  between 
census  date  and  estimate  date 


P  =  population  in  households  on  the  last  census 
date 


If  it  is  felt  the  vacancy  or  occupancy  rates  are  not 
the  same  at  the  estimate  date  as  they  were  at  the 
census  date,  the  P^  is  adjusted  accordingly. 

d.   Ratio -difference  method.  This  censal  ratio  procedure  uses  a 
difference  estimator  to  update  the  ratio  of  symptomatic  data  to 
population  from  the  base  year  to  the  time  for  which  the 
estimate  is  desired.  As  stated  earlier,  the  estimated  popu- 
lation for  an  area  is  given  in  the  following  equation 


P2  =  S2 
R2 


where 

Pp  =  current  population  estimate  for  small  area 

$2  =  current  value  of  symptomatic  data 

Rp  =  extrapolated  or  updated  ratio  of  the  value 

of  the  symptomatic  data  at  the  base  date  to  the 
population  figure  at  that  same  date. 
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In  the  ratio-difference  method  R2  is  estimated  by  using  the 
value  of  Ro  at  the  base  date,  say  R|,  and  updating  it  with 
current  and  past  ratios  at  a  level  (county)  higher  than  that 
of  the  local  area.  For  instance,  if  r^  and  r2  represent  the 
past  and  current  values,  respectively,  of  the  ratios  of  the 
symptomatic  data  to  population  of  the  higher  level  unit, 
the  estimate  of  Rp,  designated  by  R2,  is  given  by  the  difference 
estimator: 

R2  =  R]  +  r^  -  rj 

which  is  to  say  that  the  value  of  R,  is  changed  by  the  same 
rate  as  is  ri.   If  the  local  area  happens  to  be  a  city  and  the 
higher  level  ratios  are  in  terms  of  county  data,  then  the 
ratio  of  symptomatic  data  to  population  at  the  city  level  is 
assumed  to  change  at  the  same  rate  as  at  the  county  level. 
Symptomatic  data  commonly  used  in  this  method  are  automobile 
registration,  voter  registration,  and  other  types  of  personal 
registrations. 

(4)  Ratio-Correlation  Methods 

These  methods  involve  mathematically  relating  changes  in  one  or  more 
indicator  series  to  population  changes  by  using  multiple  linear  regression, 
More  specifically,  a  multiple  linear  regression  equation  is  derived  to 
express  the  relationship  between  the  changes  over  population  counts  to 
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"independent  symptomatic"  variables  highly  correlated  with  these  changes. 
Most  of  the  data  mentioned  in  this  report  can  be  used  as  the  independent 
variable.  The  success  of  the  ratio-correlation  method  depends  upon  the 
accuracy  of  the  underlying  assumption  that  the  observed  statistical  relation- 
ship between  the  independent  and  dependent  variables  during  the  time  series 
will  persist.  Furthermore,  one  must  assume  that  deficiencies  in  coverage 
in  the  basic  data  series  will  remain  constant,  or  change  yery   little  in  the 
estimate  period. 
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